33 research outputs found
Optimal Experimental Design for Infinite-dimensional Bayesian Inverse Problems Governed by PDEs: A Review
We present a review of methods for optimal experimental design (OED) for
Bayesian inverse problems governed by partial differential equations with
infinite-dimensional parameters. The focus is on problems where one seeks to
optimize the placement of measurement points, at which data are collected, such
that the uncertainty in the estimated parameters is minimized. We present the
mathematical foundations of OED in this context and survey the computational
methods for the class of OED problems under study. We also outline some
directions for future research in this area.Comment: 37 pages; minor revisions; added more references; article accepted
for publication in Inverse Problem
A brief note on the Karhunen-Lo\`eve expansion
We provide a detailed derivation of the Karhunen-Lo\`eve expansion of a
stochastic process. We also discuss briefly Gaussian processes, and provide a
simple numerical study for the purpose of illustration.Comment: 14 pages. Fixed minor typos; added some reference
Efficient D-optimal design of experiments for infinite-dimensional Bayesian linear inverse problems
We develop a computational framework for D-optimal experimental design for
PDE-based Bayesian linear inverse problems with infinite-dimensional
parameters. We follow a formulation of the experimental design problem that
remains valid in the infinite-dimensional limit. The optimal design is obtained
by solving an optimization problem that involves repeated evaluation of the
log-determinant of high-dimensional operators along with their derivatives.
Forming and manipulating these operators is computationally prohibitive for
large-scale problems. Our methods exploit the low-rank structure in the inverse
problem in three different ways, yielding efficient algorithms. Our main
approach is to use randomized estimators for computing the D-optimal criterion,
its derivative, as well as the Kullback--Leibler divergence from posterior to
prior. Two other alternatives are proposed based on a low-rank approximation of
the prior-preconditioned data misfit Hessian, and a fixed low-rank
approximation of the prior-preconditioned forward operator. Detailed error
analysis is provided for each of the methods, and their effectiveness is
demonstrated on a model sensor placement problem for initial state
reconstruction in a time-dependent advection-diffusion equation in two space
dimensions.Comment: 27 pages, 9 figure
On Bayesian A- and D-optimal experimental designs in infinite dimensions
We consider Bayesian linear inverse problems in infinite-dimensional
separable Hilbert spaces, with a Gaussian prior measure and additive Gaussian
noise model, and provide an extension of the concept of Bayesian D-optimality
to the infinite-dimensional case. To this end, we derive the
infinite-dimensional version of the expression for the Kullback-Leibler
divergence from the posterior measure to the prior measure, which is
subsequently used to derive the expression for the expected information gain.
We also study the notion of Bayesian A-optimality in the infinite-dimensional
setting, and extend the well known (in the finite-dimensional case) equivalence
of the Bayes risk of the MAP estimator with the trace of the posterior
covariance, for the Gaussian linear case, to the infinite-dimensional Hilbert
space case.Comment: 16 pages, minor changes, corrected typo
A-optimal design of experiments for infinite-dimensional Bayesian linear inverse problems with regularized -sparsification
We present an efficient method for computing A-optimal experimental designs
for infinite-dimensional Bayesian linear inverse problems governed by partial
differential equations (PDEs). Specifically, we address the problem of
optimizing the location of sensors (at which observational data are collected)
to minimize the uncertainty in the parameters estimated by solving the inverse
problem, where the uncertainty is expressed by the trace of the posterior
covariance. Computing optimal experimental designs (OEDs) is particularly
challenging for inverse problems governed by computationally expensive PDE
models with infinite-dimensional (or, after discretization, high-dimensional)
parameters. To alleviate the computational cost, we exploit the problem
structure and build a low-rank approximation of the parameter-to-observable
map, preconditioned with the square root of the prior covariance operator. This
relieves our method from expensive PDE solves when evaluating the optimal
experimental design objective function and its derivatives. Moreover, we employ
a randomized trace estimator for efficient evaluation of the OED objective
function. We control the sparsity of the sensor configuration by employing a
sequence of penalty functions that successively approximate the
-"norm"; this results in binary designs that characterize optimal
sensor locations. We present numerical results for inference of the initial
condition from spatio-temporal observations in a time-dependent
advection-diffusion problem in two and three space dimensions. We find that an
optimal design can be computed at a cost, measured in number of forward PDE
solves, that is independent of the parameter and sensor dimensions. We
demonstrate numerically that -sparsified experimental designs obtained
via a continuation method outperform -sparsified designs.Comment: 27 pages, accepted for publication in SIAM Journal on Scientific
Computin
Monte Carlo Estimators for the Schatten p-norm of Symmetric Positive Semidefinite Matrices
We present numerical methods for computing the Schatten -norm of positive
semi-definite matrices. Our motivation stems from uncertainty quantification
and optimal experimental design for inverse problems, where the Schatten
-norm defines a design criterion known as the P-optimal criterion. Computing
the Schatten -norm of high-dimensional matrices is computationally
expensive. We propose a matrix-free method to estimate the Schatten -norm
using a Monte Carlo estimator and derive convergence results and error
estimates for the estimator. To efficiently compute the Schatten -norm for
non-integer and large values of , we use an estimator using a Chebyshev
polynomial approximation and extend our convergence and error analysis to this
setting as well. We demonstrate the performance of our proposed estimators on
several test matrices and through an application to optimal experimental design
of a model inverse problem.Comment: 21 pages, 10 figures, 1 tabl
Goal-Oriented Optimal Design of Experiments for Large-Scale Bayesian Linear Inverse Problems
We develop a framework for goal-oriented optimal design of experiments
(GOODE) for large-scale Bayesian linear inverse problems governed by PDEs. This
framework differs from classical Bayesian optimal design of experiments (ODE)
in the following sense: we seek experimental designs that minimize the
posterior uncertainty in the experiment end-goal, e.g., a quantity of interest
(QoI), rather than the estimated parameter itself. This is suitable for
scenarios in which the solution of an inverse problem is an intermediate step
and the estimated parameter is then used to compute a QoI. In such problems, a
GOODE approach has two benefits: the designs can avoid wastage of experimental
resources by a targeted collection of data, and the resulting design criteria
are computationally easier to evaluate due to the often low-dimensionality of
the QoIs. We present two modified design criteria, A-GOODE and D-GOODE, which
are natural analogues of classical Bayesian A- and D-optimal criteria. We
analyze the connections to other ODE criteria, and provide interpretations for
the GOODE criteria by using tools from information theory. Then, we develop an
efficient gradient-based optimization framework for solving the GOODE
optimization problems. Additionally, we present comprehensive numerical
experiments testing the various aspects of the presented approach. The driving
application is the optimal placement of sensors to identify the source of
contaminants in a diffusion and transport problem. We enforce sparsity of the
sensor placements using an -norm penalty approach, and propose a
practical strategy for specifying the associated penalty parameter.Comment: 25 pages, 13 figure
Mean-variance risk-averse optimal control of systems governed by PDEs with random parameter fields using quadratic approximations
We present a method for optimal control of systems governed by partial
differential equations (PDEs) with uncertain parameter fields. We consider an
objective function that involves the mean and variance of the control
objective, leading to a risk-averse optimal control problem. To make the
problem tractable, we invoke a quadratic Taylor series approximation of the
control objective with respect to the uncertain parameter. This enables
deriving explicit expressions for the mean and variance of the control
objective in terms of its gradients and Hessians with respect to the uncertain
parameter. The risk-averse optimal control problem is then formulated as a
PDE-constrained optimization problem with constraints given by the forward and
adjoint PDEs defining these gradients and Hessians. The expressions for the
mean and variance of the control objective under the quadratic approximation
involve the trace of the (preconditioned) Hessian and are thus prohibitive to
evaluate. To address this, we employ trace estimators that only require a
modest number of Hessian-vector products. We illustrate our approach with two
problems: the control of a semilinear elliptic PDE with an uncertain boundary
source term, and the control of a linear elliptic PDE with an uncertain
coefficient field. For the latter problem, we derive adjoint-based expressions
for efficient computation of the gradient of the risk-averse objective with
respect to the controls. Our method ensures that the cost of computing the
risk-averse objective and its gradient with respect to the control, measured in
the number of PDE solves, is independent of the (discretized) parameter and
control dimensions, and depends only on the number of random vectors employed
in the trace estimation. Finally, we present a comprehensive numerical study of
an optimal control problem for fluid flow in a porous medium with uncertain
permeability field.Comment: 27 pages. Minor revisions. Accepted for publication in SIAM/ASA
Journal on Uncertainty Quantificatio
Variance-based sensitivity analysis for time-dependent processes
The global sensitivity analysis of time-dependent processes requires
history-aware approaches. We develop for that purpose a variance-based method
that leverages the correlation structure of the problems under study and
employs surrogate models to accelerate the computations. The errors resulting
from fixing unimportant uncertain parameters to their nominal values are
analyzed through a priori estimates. We illustrate our approach on a harmonic
oscillator example and on a nonlinear dynamic cholera model.Comment: 28 Pages; revised version; accepted for publication in Reliability
Engineering & System Safet
A Fast and Scalable Method for A-Optimal Design of Experiments for Infinite-dimensional Bayesian Nonlinear Inverse Problems
We address the problem of optimal experimental design (OED) for Bayesian
nonlinear inverse problems governed by PDEs. The goal is to find a placement of
sensors, at which experimental data are collected, so as to minimize the
uncertainty in the inferred parameter field. We formulate the OED objective
function by generalizing the classical A-optimal experimental design criterion
using the expected value of the trace of the posterior covariance. We seek a
method that solves the OED problem at a cost (measured in the number of forward
PDE solves) that is independent of both the parameter and sensor dimensions. To
facilitate this, we construct a Gaussian approximation to the posterior at the
maximum a posteriori probability (MAP) point, and use the resulting covariance
operator to define the OED objective function. We use randomized trace
estimation to compute the trace of this (implicitly defined) covariance
operator. The resulting OED problem includes as constraints the PDEs
characterizing the MAP point, and the PDEs describing the action of the
covariance operator to vectors. The sparsity of the sensor configurations is
controlled using sparsifying penalty functions. We elaborate our OED method for
the problem of determining the sensor placement to best infer the coefficient
of an elliptic PDE. Adjoint methods are used to compute the gradient of the
PDE-constrained OED objective function. We provide numerical results for
inference of the permeability field in a porous medium flow problem, and
demonstrate that the number of PDE solves required for the evaluation of the
OED objective function and its gradient is essentially independent of both the
parameter and sensor dimensions. The number of quasi-Newton iterations for
computing an OED also exhibits the same dimension invariance properties.Comment: 30 pages; minor revisions; accepted for publication in SIAM Journal
on Scientific Computin